Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech

نویسندگان

  • Benjamin Cauchi
  • Ina Kodrasi
  • Robert Rehr
  • Stephan Gerlach
  • Ante Jukic
  • Timo Gerkmann
  • Simon Doclo
  • Stefan Goetze
چکیده

This paper presents a system aiming at joint dereverberation and noise reduction by applying a combination of a beamformer with a single-channel spectral enhancement scheme. First, a minimum variance distortionless response beamformer with an online estimated noise coherence matrix is used to suppress noise and reverberation. The output of this beamformer is then processed by a single-channel spectral enhancement scheme, based on statistical room acoustics, minimum statistics, and temporal cepstrum smoothing, to suppress residual noise and reverberation. The evaluation is conducted using the REVERB challenge corpus, designed to evaluate speech enhancement algorithms in the presence of both reverberation and noise. The proposed system is evaluated using instrumental speech quality measures, the performance of an automatic speech recognition system, and a subjective evaluation of the speech quality based on a MUSHRA test. The performance achieved by beamforming, single-channel spectral enhancement, and their combination are compared, and experimental results show that the proposed system is effective in suppressing both reverberation and noise while improving the speech quality. The achieved improvements are particularly significant in conditions with high reverberation times.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Wavelet Transform Extrema Clustering for Multi-channel Speech Dereverberation

This paper presents a method for enhancing multi-channel reverberant speech using event-based processing of wavelet transform coefficients. Clustering of the wavelet extrema across multiple channels is employed to obtain a single multi-scale extrema representation from which the enhanced signal is synthesized. Processing is done in the LPC residual domain, with the entire analysis being precede...

متن کامل

Enhancement of Reverberant and Noisy Speech by Extending Its Coherence

We introduce a novel speech enhancement algorithm for removing reverberation and noise from recorded speech data. Our approach centers around using a single-channel minimum mean-square error log-spectral amplitude (MMSELSA) estimator, which applies gain coefficients in a timefrequency domain to suppress noise and reverberation. The main contribution of this paper is that the enhancement is done...

متن کامل

A New Post-filter Algorithm Combined with Two-step Adaptive Beamformer

The optimal microphone array, in the sense of minimum mean square errors (MMSE), includes two processing blocks: the minimum variance distortionless response (MVDR) beamformer and the single-channel Wiener filter, which acts as post-filter. In this paper, we propose a new post-filter algorithm based on assumptions that both the noise power attenuation factor (NPAF) and signal power attenuation ...

متن کامل

Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment

A blind dereverberation method based on spectral subtraction using a multi-channel least mean squares algorithm was previously proposed. The results of a large vocabulary continuous speech recognition task showed that this method achieved significant improvements over the conventional method based on cepstral mean normalization and beamforming in a simulated reverberant environment without addi...

متن کامل

Enhancement and Recognition of Reverberant and Noisy Speech by Extending Its Coherence

Most speech enhancement algorithms make use of the short-time Fourier transform (STFT), which is a simple and flexible time-frequency decomposition that estimates the short-time spectrum of a signal. However, the duration of short STFT frames are inherently limited by the nonstationarity of speech signals. The main contribution of this paper is a demonstration of speech enhancement and automati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • EURASIP J. Adv. Sig. Proc.

دوره 2015  شماره 

صفحات  -

تاریخ انتشار 2015